منابع مشابه
16-899C ACRL Tetris Reinforcement Learner
Our approach to this problem was to use reinforcement learning with a function approximator to approximate the state value function [RSS98]. In our case, a +1 reward was given for every completed line, so that the value function would encode the long-term number of lines that is going to be completed by the algorithm. In order to achieve this, we extract features from the game state, and use gr...
متن کاملEX Α FS and Near Edge Structure IV July 7 - 11 , 1986
A d s o r p t i o n and thermal decomposition of IMi(CG)^ i n t h e cage system o f z e o l i t e Y have been s t u d i e d u i t h EXAFS, e l e c t r o n microscopy and IR s p e c t r o s c o p y , Ni(CO)^ i s adsorbed as an i n t a c t molecule i n both c a t i o n f r e e z e o l i t e Y and NaY. Symmetry changes of the molecule i n NaY are assigned t o the f o r m a t i o n o f Na —OC-IMi b...
متن کاملThe July 1986 Oceanside (ml = 5.3) Earthquake Sequence in the Continental Borderland, Southern California by Egill Hauksson and Lucile
An earthquake of M, = 5.3 occurred at 32°58.7'N, 117°51.5'W southwest of Oceanside in San Diego County at 13:47 13 July 1986 (UT). This main shock was followed by an extensive aftershock sequence, with 55 events of ML > 3.0 during July 1986. The epicenters of the main shock and aftershocks are located at the northern end of the San Diego Trough-Bahia Soledad fault zone (SDT-BS) where it changes...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: College & Research Libraries News
سال: 2020
ISSN: 2150-6698,0099-0086
DOI: 10.5860/crln.47.9.589